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Abstract 

This paper considers linear quadratic team decision problems where the players in 
the team affect each other's information structure through their decisions. Whereas the 
stochastic version of the problem is well known to be complex with nonlinear optimal 
solutions that are hard to find, the deterministic counterpart is shown to be tractable. 
We show that under a mild assumption, where the weighting matrix on the controller is 
chosen large enough, linear decisions are optimal and can be found efficiently by solving a 
semi-definite program. 
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Notation 
The set of n x n symmetric matrices. 

§" The set of n x n symmetric positive 

semidefinite matrices. 

§++ The set of n x n sjonmetric positive 

definite matrices. 

C The set of functions // : R^' ^ R"' with 

M = Mvi), i^2iy2) , i^NivN)), 

Hi : R« ^ rrii = m, ZiPi = P- 

K {K eW^'PlK = ®J2Ki,KieW^^''p^} 

Denotes the pseudo-inverse of the 

square matrix A. 
A± Denotes the matrix with minimal number 

of columns spanning the nuUspace of A. 
Ai The ith block row of the matrix A. 

Aij The block element of A in position(i, j). 

>- Ay B A- B eSl. 

y Ay B A- B eSl^. 

Tr Tr[^] is the trace of the matrix A. 

J\f{m, X) The set of Gaussian variables with 

mean m and covariance X. 
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I. Introduction 

The team problem is an optimization problem, where a number of decision makers 
(or players) make up a team, optimizing a common cost function with respect to some 
uncertainty representing nature. Each member of the team has limited information 
about the global state of nature. Furthermore, the team members could have different 
pieces of information, which makes the problem different from the one considered in 
classical optimization, where there is only one decision function that has access to 
the entire information available about the state of nature. 

Team problems seemed to possess certain properties that were considerably dif- 
ferent from standard optimization, even for specific problem structures such as the 
optimization of a quadratic cost in the state of nature and the decisions of the 
team members. In stochastic linear quadratic decision theory, it was believed for a 
while that certainty-equvalence holds between estimation and optimal decision with 
complete information, even for team problems. The certainty-equivalence principle 
can be briefly explained as follows. First assume that every team member has access 
to the information about the entire state of nature, and find the corresponding 
optimal decision for each member. Then, each member makes an estimate of the 
state of nature, which is in turn combined with the optimal decision obtained from 
the full information assumption. It turns out that this strategy does not yield an 
optimal solution (see [[9]]). 

A general solution to static stochastic quadratic team problems was presented by 
Radnor [9]. Radnor's result gave hope that some related problems of dynamic nature 
could be solved using similar arguments. But in 1968, Witsenhausen fllll showed in 
his well known paper that finding the optimal decision can be complex if the decision 
makers affect each other's information. Witsenhausen considered a dynamic decision 
problem over two time steps to illustrate that difficulty. The dynamic problem can 
actually be written as a static team problem: 

minimize E {kou^ + (x + no — "Ui)^} 
subject to uq = /io(2;), ui = fii{x + uq + w) , 
where x and w are Gaussian with zero mean and variance X and W, respectively. 
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Fig. 1. Coding-decoding diagram over a Gaussian channel. 



Here, we have two decision makers, one corresponding to uq, and the other to ui. 
Witsenhausen showed that the optimal decisions and are not hnear because of 
the signaling/coding incentive of uq. Decision maker ui measures x+uq+w, and hence, 
its measurement is affected by uq. Decision maker uq tries to encode information 
about X in its decision, which makes the optimal strategy complex. 

The problem above is actually an information theoretic problem. To see this, con- 
sider the slightly modified problem 

minimize E (x — tti)^ 

subject to Uq = Ho{x) , E -Uq < 1, Ui = jJLi{uQ + w) 

The modification made is that we removed uq from the objective function, and in- 
stead added a constraint E < 1 to make sure that it has a limited variance (of 
course we could set an arbitrary power limitation on the variance). The modified 
problem is exactly the Gaussian channel coding/decoding problem (see Figure [1])! 
The optimal solution to Witsenhausens counterexample is still unknown. Even if we 
would restrict the optimization problem to the set of linear decisions, there is still 
no known polynomial-time algorithm to find optimal solutions. Another interesting 
counterexample was recently given in |[7l]. 

In this paper, we consider the problem of distributed decision making with infor- 
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mation constraints under linear quadratic settings. For instance, information con- 
straints appear naturally when making decisions over networks. These problems 
can be formulated as team problems. Early results considered static team theory 
in stochastic settings (Si, (U, |[5]|. In (SI, the team problem with two team members 
was solved. The solution cannot be easily extended to more than two players since 
it uses the fact that the two members have common information; a property that 
doesn't necessarily hold for more than two players. Il2]| uses the result to consider the 
two-player problem with one-step delayed measurement sharing with the neighbors, 
which is a special case of the partially nested information structure, where there 
is no signaling incentive. Also, a nonlinear team problem with two team members 
was considered in IS]], where one of the team members is assumed to have full 
information whereas the other member has only access to partial information about 
the state of the world. Related team problems with exponential cost criterion were 
considered in [6]. Optimizing team problems with respect to affine decisions in a 
minimax quadratic cost was shown to be equivalent to stochastic team problems 
with exponential cost, see (S]]. The connection is not clear when the optimization is 
carried out over nonlinear decision functions. In [4], a general solution was given 
for an arbitrary number of team members, where linear decision were shown to be 
optimal and can be found by solving a linear matrix inequality. In the deterministic 
version of Witsenhausen's counterexample, that is minimizing the quadratic cost 
with respect to the worst case scenario of the state x (instead of the assumption that 
X is Gaussian), the linear decisions where shown to be optimal in |fTOl. 

We will show that for static linear quadratic minimax team problems, where the 
players in the team affect each others information structure through their decisions, 
linear decisions are optimal in general, and can be found by solving a linear matrix 
inequality. 
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II. Main Results 

The deterministic problem considered is a quadratic game between a team of 
players and nature. Each player has limited information that could be different from 
the other players in the team. This game is formulated as a minimax problem, where 
the team is the minimizer and nature is the maximizer. We show that if there is a 
solution to the static minimax team problem, then linear decisions are optimal, and 
we show how to find a linear optimal solution by solving a linear matrix inequality. 



III. Deterministic Team Problems with Signaling Incentive 
Consider the following team decision problem 

L{w, u) 



inf sup 



IwP + llf IP 



N 



subject to Ui = ^ DijUj + EiW + 
i=i 

for i = 1,...,N, 
where Ui e R'"' and Ei G W^""}, for i = 1, A^, 
L{w,u) is a quadratic cost given by 

L(w, u) 



w 


T 


Qww 


Qwu 




w 


u 




Quw 


Quu 




U 



m+n 



(1) 



Quu G m = mi H h mjq, and 

Qww Qwu 
Quw Quu 

The players un make up a team, which plays against nature represented 

by the vector w, using /i G C. This problem is more complicated than the static 
team decision problem studied in [4], since it has the same flavour as that of the 
Witsenhausen counterexample that was presented in the introduction. We see that 
the measurement Ui of decision maker i could be affected by the other decision makers 
through the terms DijUj, j = 1, ...,N. 
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Note that we have the equahty y = Du + Ew + v which is equivalent to ^; = 
Du + Ew — y. Using this substitution of variable, the team problem ([1]) is equivalent 
to 



inf 



sup 



(2) 



\\D^{y) + Ew — y|P + lliwIP 
For finite measurements y, one would typically like to punish large values for the 
decision function /i. So, having large values u for the decision functions should not 
get us optimality. Therefore, we will make the following mild assumption on the 
design of the weighting matrix Quu- 
Assumption 1: Quu is chosen large enough, such that 

W^QuiwW ^ . „ U^QuuU 

7 = sup II < mt T.^^^ ■ 

^,^0 F Duf^o Du 



Theorem 1: Let 7* be the value of the game ([T]) and 7 = sup^_^o "''^[^[j" "* ■ Suppose 
that Assumption [1] holds. Then the following statements hold: 

(i) The game ([T]) achieves the value 7 for fi{y) = 0, and 7"^ < 7. 

(ii) There exist linear decisions yUi(yi) = Kiyi, i = 1,...,N, where the value 7* is 
achieved. 

(iii) If 7* < 7, then for any 7 G [7* , 7), a linear decision Ky with K eK that achieves 
7 is obtained by solving the linear matrix inequality 

find K 



subject to K = diag(fCi, ...,K 



N) 



c = 



I 



Qxxij) Qxuil) 
Quxil) Quuil) 



Qww Qwu E^E —E^ —E^D 

Q Q Q -1 -E I -D 

Quu, Quu\ [-D'^E -D^ D^D 

Qxxil) + Qxu{l)KC + C^K^Quxil) C^K^ 

KC -Qzlii) 



^0, 



Proof: 
(i) We have that 



L u),0 
sup II 110 , II — ^ = sup 



w^QwwW 



\w\ 



7, 
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for all V and w 0. Hence, 7* < 7. 
in) Note that 

y = Du + Ew + V <^==^ V = y — Du — Ew 
L{w,u) L{w,u) 



If I P + \\wP 



\y — Du — EwW^ + lli^P 
Now introduce x eMJ^, n = p + q, such that 



X 



and 





Qww 


Qwu 
















Quw 


Quu 






E^E 


-E^ 


~E^D 




-E 


I 


-D 




-D^E 







Then, 







T 




J(x, -u) := 


X 




X 




Q 






u 




u 






T 






X 




X 


-u) := 




R 






u 




u 



L(w, u) 



\y — Du — Ew\\'^ + 11^, 



and y = Cx. Hence, we have that 

L{w, u) 



L(w, u) 



|f I P + \\y — Du — EwW"^ 



\w\ 



J{x, u) 
F{x, u) 



(3) 



Then, for any 7 G (7* , 7), there exists a decision function G C such that 





T r 


X 




ji{Cx) 





J(x, — 'yF{x, 

for all X. Under Assumption [U we have that 

Quuh) = Quu--fD'^DyO 



Qxxh) Qxuh) 
Quxh) Quuh) 



X 

ji{Cx) 



< 
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for any 7 G (7* , 7]. Thus, we can apply Theorem 1 in BH, which imphes that there 
must exist Hnear decisions that can achieve any 7 G (7* , 7]. By compactness, there 
must exist hnear decisions that achieve 7*. 



(Hi) Let //(Cx) = KCx for K eK. Then 
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-1 1 
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Quxin) Quuh) 
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<0, Va; 
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Qxx(7) + Qxuh)KC + C^K^Qux{l) + C' K' Q^^{^)KC < 



I 

QxAl) + Qxu{l)KC + C^K^Q„,(7) C^K-^ 

KC -Qu^M^ 

and the proof is complete. 



^0, 



IV. Linear Quadratic Control with Arbitrary Information Constraints 
Consider the dynamic team decision problem 



inf sup 



x{k) 
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Qxx 


Qxu 




x{k) 


u{k) 




Qux 


Quu 




u{k) 



EtiMkW + HkW 

subject to x{k + 1) = Ax{k) + Bu{k) + w{k) 
yi{k) = Cix{k) + Vi{k) 
Ui{k) = lukliiViik)), i = 1, N. 



(4) 
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Now write x{t) and y{t) as 

t t 
x{t) = ^ A^Bu{M -k) + Y^ A''w{M - k), 

k=l k=l 
t t 

y,{t) = C^A^Bu{M - fc) + ^ CiA''w{M - k) + v^{k). 

k=l k=l 

It is easy to see that the optimal control problem above is equivalent to a static team 
problem of the form ©. Thus, linear controllers are optimal under Assumption [H 

Example 1: Consider the deterministic version of the Witsenhausen counterexam- 
ple presented in the introduction: 

inf 7 

P^liVi) + {xi - ^2(2/2))^ . 
s. t. — — < 7 

Xq + 

yi = xo 

xi = Xo + Hiivi) 

y2= Xi+W = Xo + fJ-liVl) + W 

Substitue xq = yi, xi = yi + /ii(yi) and w'^ = {xq + ^i{yi) — y2f in the inequality 

k'^lAiVi) + (xi - /i2(y2))^ < 7(2^0 + ^^)- 
Then, we get the equivalent problem 
inf 7 

s. t. k^nliyi) + {yi + niiyi) - fi2iy2)? < l{yf + (vi + - 1/2)^) 

Completing the squares gives the following equivalent inequality 

yi 1 - 27 7 1-7 -1 yi 

y2 ^ ^ 2/2 ^ Q 

^i(yi) 1-7 7 l + /i;2-7 -1 /ii(yi) 

.^2(?/2)J [ -1 -1 1 J [/i2(2/2). 

For k'^ = 0.1, we can search over 7<7 = A;^ = 0.1, and we can use Theorem [1] to 
deduce that linear decisions are optimal, and can be computed by iteratively solving 
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a linear matrix inequality, where the iterations are done with respect to 7. We find 
that 

7^" ^ 0.0901, 
^1(2/1) = -0.90017/1, 
^2(2/2) = -0.08967/2. 

For k'^ = 1, we iterate with respect to 7 < 1, and we find optimal linear decisions 
given by 

/ii(7/i)=-0.38562/i 
/W2(l/2)=0.3840?/2 

7* = 0.3820 

Example 2: Consider the deterministic counterpart of the multi-stage finite-horizon 
stochastic control problem that was considered in EJ: 

subject to the dynamics 

yk = Xk + Vk. 

It is easy to check that 7 = 1 and (5««— 7-D^-D >- for 7 < 7 (compare with Assumption 
[1]) . Thus, linear decisions are optimal. This is compared to the stochastic version, 
where linear decisions where not optimal for m > 2. 

V. Conclusions 

We have considered the static team problem in deterministic linear quadratic 
settings where the team members may affect each others information. We have 
shown that decisions that are linear in the observations are optimal and can be 
found by solving a linear matrix inequality. 

For future work, it would be interesting to consider the case where the measure- 
ments are given by y = Du + Ew + Fv, for an arbitrary matrix F. 
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